首页> 外文OA文献 >An Analysis of Visual Question Answering Algorithms
【2h】

An Analysis of Visual Question Answering Algorithms

机译:视觉问题答疑算法分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In visual question answering (VQA), an algorithm must answer text-basedquestions about images. While multiple datasets for VQA have been created sincelate 2014, they all have flaws in both their content and the way algorithms areevaluated on them. As a result, evaluation scores are inflated andpredominantly determined by answering easier questions, making it difficult tocompare different methods. In this paper, we analyze existing VQA algorithmsusing a new dataset. It contains over 1.6 million questions organized into 12different categories. We also introduce questions that are meaningless for agiven image to force a VQA system to reason about image content. We propose newevaluation schemes that compensate for over-represented question-types and makeit easier to study the strengths and weaknesses of algorithms. We analyze theperformance of both baseline and state-of-the-art VQA models, includingmulti-modal compact bilinear pooling (MCB), neural module networks, andrecurrent answering units. Our experiments establish how attention helpscertain categories more than others, determine which models work better thanothers, and explain how simple models (e.g. MLP) can surpass more complexmodels (MCB) by simply learning to answer large, easy question categories.
机译:在视觉问题解答(VQA)中,算法必须回答有关图像的基于文本的问题。自2014年末以来已创建了多个VQA数据集,但它们的内容和算法评估均存在缺陷。结果,评估分数被夸大,并且主要通过回答更简单的问题来确定,因此很难比较不同的方法。在本文中,我们使用新的数据集分析了现有的VQA算法。它包含超过160万个问题,分为12个不同类别。我们还介绍了对于给定的图像强制VQA系统推理图像内容没有意义的问题。我们提出了一种新的评估方案,该方案可以弥补过度代表的问题类型,并使研究算法的优缺点更加容易。我们分析了基线和最新VQA模型(包括多模式紧凑型双线性池(MCB),神经模块网络和递归应答单元)的性能。我们的实验确定了注意力如何帮助您比其他人更好地确定类别,确定哪种模型比其他模型更有效,并解释简单的模型(例如MLP)如何通过简单地学习回答大型且容易的问题类别来超越更复杂的模型(MCB)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号